Picture for Jie Zhao

Jie Zhao

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

M2I2HA: Multi-modal Object Detection Based on Intra- and Inter-Modal Hypergraph Attention

Add code
Jan 24, 2026
Viaarxiv icon

M2I2HA: A Multi-modal Object Detection Method Based on Intra- and Inter-Modal Hypergraph Attention

Add code
Jan 21, 2026
Viaarxiv icon

FastStair: Learning to Run Up Stairs with Humanoid Robots

Add code
Jan 15, 2026
Viaarxiv icon

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis

Add code
Dec 29, 2025
Viaarxiv icon

DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation

Add code
Nov 17, 2025
Viaarxiv icon

Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap

Add code
Sep 16, 2025
Figure 1 for Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap
Figure 2 for Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap
Figure 3 for Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap
Figure 4 for Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap
Viaarxiv icon

GDLLM: A Global Distance-aware Modeling Approach Based on Large Language Models for Event Temporal Relation Extraction

Add code
Aug 28, 2025
Viaarxiv icon

Progressive Bird's Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey

Add code
Aug 11, 2025
Viaarxiv icon